Compactness Properties 
of Weighted Summation Operators on Trees 



Mikhail Lifshits Werner Linde 

January 29, 2011 

Abstract 

We investigate compactness properties of weighted summation operators Va^cr as mapping 
from £i{T) into £q{T) for some q G {l,oo). Those operators are defined by 



syt 

where T is a tree with induced partial order t ^ s (or s ^ t) for t, s £ T. Here a and a 
are given weights on T. We introduce a metric d on T such that compactness properties of 
(T, d) imply two-sided estimates for en{Va,a-), the (dyadic) entropy numbers of Va,cr- The 
results are applied for concrete trees as e.g. moderate increasing, biased or binary trees and 
for weights with a{t)a{t) decreasing either polynomially or exponentially. We also give some 
probabilistic applications for Gaussian summation schemes on trees. 
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1 Introduction 



This work essentially stems from the article [13] where the entropy of linear Volterra integral 
operators was studied in a difficult critical case. Handling this case required a new technique 
and it turned out that this technique could be cleanly elaborated and better explained if we 
replace the Volterra operator by an analogous summation operator on the binary tree. Notice 
that trees appear naturally in the study of functional spaces because the Haar base and other 
similar wavelet bases indeed have a structure close to that of a binary tree. 

The class of summation operators on trees is quite simple and natural but it is absolutely not 
investigated and we believe that a deeper study of its properties, as presented here, is not only 
interesting in its own right but might also be helpful as a model for studying more conventional 
classes of operators. 

Thus let T be a finite or infinite tree and let ,, ^ " be the partial order generated by its 
structure, i.e. t < s means that t is situated on the way leading from the root of the tree to s. 
IfA;:TxTi— T-Misa kernel satisfying 

sup V|A:(t,s)|^ < oo (1.1) 

for some q G [l,oo), then the Volterra-type summation operator V with 

{Vx){t) :=Y,k{t,s)x{s), teT, 

is bounded from ii{T) into iq{T). Compactness properties of V surely depend on the kernel 
k as well as on the structure of the underlying tree. It seems to be hopeless to describe such 
properties of V in this general context. A first step could be the investigation of this problem in 
the case of special kernels k (and for quite general trees). Thus we restrict ourselves to kernels 
k which may be written as k{t,s) = a{t)a{s) for some given weights a, a : T i— >• (0, oo) where 
we assume that a is non-increasing. Condition (|1.1|) reads then as 

y a(r)5 I ct(s) <oo (1.2) 



sup 

sGT 



and V = Vaa acts as 



{Va,ax){t) = a{t) V a{s)x{s) , teT . (1.3) 



Note that adding signs to a and a does not change compactness properties (as well as any other 
property), thus assuming positive weights we do not lose the generality. 

In the linear case T = No, those weighted summation operators have been investigated in [B]. 
The main observation in this paper was that such operators may be regarded as special weighted 
integration operators and, consequently, their properties follow by those for integration operators 
as proved in [8], [9], [H], [H], etc. 
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The situation is completely different for general trees. Here an application of known results 
about Volterra integration operators is not available. Therefore summation operators in this 
general context have to be treated independently and new interesting phenomena appear because 
the structure of the underlying tree plays an important role. 

The main objective of the present paper is to investigate compactness properties of operators 
Va,a defined in Our basic observation is as follows. Suppose we are given weights a and 

a satisfying (|1.2p with a non-decreasing and let q G (l,oo). If t ^ s are in T we define their 
distance by 



d{t,s) := max «( 




Then d may be extended to a metric d on T. Let N{T,d,£) be the covering numbers of {T,d), 



I.e., 



N{T,d,£) := inf < 



n > 1 : T = IJ B,{tj 



with (open) e-balls Bi,{tj) for certain tj G T. We prove that the behavior of N{T, d,e) as e — )• 
is tightly connected with the degree of compactness of Va^a- More precisely, let eniVa^a) be the 
sequence of dyadic entropy numbers defined by 

e-n{ya,a) '■= inf [e > : {Va^aX : \\x\\i < 1} is covered by 2"~^ open e-balls in iq{T)] 

(we refer to [5J for more information about entropy numbers). We prove that the upper (lower) 
bounds for N(T,d,e) yield upper (lower) bounds for eniVa^a)- 

For example, as shown in Theorem 14.41 given a > and 6 > it follows that 

N{T,d,£) < ce-" |loge|^ implies en(K.,<x) < c n-^/^-^/P' {logn f" 

with p := mm{2,q} and 1/p' = 1 — 1/p. In Theorem 15.31 we prove a similar result assuming 
N{T,d,e) > ce~" |loge|''. In particular, if 1 < g < 2, then 

N{T,d,e) ^ e-" \loge\^ implies en{Va,a) ~ n-^/^-^/'^' (logn)^/^ . 

We also treat the case that N{T,d,e) increases exponentially. Besides some critical case, sharp 
estimates are obtained as well. 

Thus in order to get precise estimates for eniVa^a) it suffices to describe the behavior of 
N{T, d, e) in dependence of properties of the weights a and a and of the structure of the tree. 
This question is investigated in Sections E] and El Here we prove quite precise estimates for 
N{T, d, e) in the case of moderate trees (the number of nodes in the n-th generation increases 
polynomially) or for binary trees provided we know something about the behavior of a{t)(T{t). 
In Section [8] we investigate a class of trees where the branches die out very quickly. Here the 
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behavior of N(T, d, e) is completely different from the one observed for trees where each node 
has at least one offspring. This example demonstrates the influence of the tree structure to 
compactness properties of Va^a- 

In SectionOwe sketch a probabilistic interpretation of our results by providing the asymptotic 
of small deviation probabilities for some tree-indexed Gaussian random functions and at the 
end in Section [10] we state some open problems related to the topic of the present paper. 

Let us finally mention that throughout this paper we always denote by c or C (with or 
without subscript) universal constants which may vary even in one line. The constants may 
depend on q but neither on n nor on the behavior of the weights. 

2 Trees 

Let us recall some basic notations related to trees which will be used later on. In the sequel T 
always denotes a finite or an infinite tree. We suppose that T has a unique root which we denote 
by and that each element t £ T has a finite number ^(t) of offsprings. Thereby we do not 
exclude the case ^(t) = 0, i.e., some elements may "die out". The tree structure leads in natural 
way to a partial order ,, ^ " by letting t ^ s and s ^ t provided there are t = to,ti, . . . ,tm = s 
in T such that for 1 < j < m the element tj is an offspring of tj-i- The strict inequalities have 
the same meaning with the additional assumption t ^ s. Two elements t,s £ T are said to be 
comparable provided that either t < s or s ^ t. Otherwise we say that t and s are incomparable. 
For t,s £ T with t ^ s the order interval [t, s] is defined by 

[t,s] := {r £T -.t ^ s} 

and in a similar way we construct (t, s]. 

A subset S C T is said to be a branch provided that all elements in B are comparable and, 
moreover, if t ^ r ^ s with t,s G B, then this implies r £ B as well. Of course, finite branches 
are of the form [t, s] for suitable t < s. 

Given s G T its order |s| > is defined by 

\s\ := i^{t £T :t ^ s} . 

Then 

R(n) := #{t £T :\t\ =n} , n>0, 
is the number of elements in the n-th generation of T. 

3 Metrics and £— Nets on Trees 

Suppose we are given two weight functions a : T i— (0, oo) and ex : T i— t- (0, oo) where we assume 
that a is non-increasing, i.e., if f ^ s, then it follows that a{t) > o"(s). 
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Given q G [1, oo) and t,v G T with t ^ v, we set 





(3.1) 



We let d{t, s) := d{s, t) provided that t ^ s and 

d{t, s) := d{t As,t)+ d{t A s, s) 

whenever t and s are incomparable. Here t A s denotes the infimum of t and s which may be 
defined as the maximal element in [0, t] n [0, s\. 

Remark 3.1 Since a is assumed to be non-increasing it follows that for t < s 



A similar expression (for weights and intervals on M) played an important role in the investigation 
of weighted integration operators (c.f. [8], [9], [11] and [E]). 

Proposition 3.2 The mapping d constructed above is a metric on T possessing the following 
monotonicity property: Whenever t' ^ t ^ s ^ s' , then d{t, s) < d{t', s'). 

Proof: The monotonicity property is a direct consequence of the definition of d. 

Clearly we have d{t, s) > and since we assumed a{t) > for t £ T we see that d{t, s) = 
implies t = s. By the construction we also have d{t,s) = d{s,t), thus it remains to prove the 
triangular inequality 



whenever t,s,r G T. To verify this inequality one has to treat separately six different cases in 
dependence of the relation between t, s and r. Among them only one is non-trivial, namely, the 
case that t, s and r are on a common branch and satisfy t ^ r ^ s or, equivalently, s ^ r ^ t. 
Therefore we only include the proof in that situation. 

Assume t ^r < s and choose t; in T with t ~< v < s where the maximum in (j3.ip is attained. 
Then we have to distinguish between the two following cases: v <r and r -< f . 

In the first case we have 

d{t,s) = \\al(^t,v\\\q o-(w) < ^max^|||al(fy]||^ cf{v')^ = d{t,r) < d{t,r) + d{r,s), 
and we are done. 




d{t,s) < d{t,r) +d{r,s) 
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Suppose now r ~< v. Here we argue as follows: 

d{t,s) = ||al(t^„]||^ a{v) < (j\al(^t,r]\\q + criv), 
and since a is non-increasing, it follows that 

d{t, s) < ||a l{t,r] 11^ (^{r) + ||a l(r,,;] < d{t, r) + d{r, s), 

as asserted. Thus the proof is completed. □ 

Our next objective is to investigate e-nets for T w.r.t. the metric d possessing an additional 
useful property. Given e > 0, a set S C T is said to be an order e-net provided that for each 
t G T there is a s S 5 satisfying d{s, t) < e as well as s ^ t. The corresponding order covering 
numbers of T are then 

N{T, d, e) := inf {#5 : 5 is an order e~net of T] . (3.2) 
Recall that the usual covering numbers N{T, d,e) were defined by 



N{T,d,e) = mfl#S:ScT,T=]J Be{s) i 



where -B£(s) is the open e-ball centered at s £ T, i.e. 

B,{s) := {r e r : d{r,s) < e} . 

Clearly we have 

N{T,d,e) < N{T,d,e) , 

but as we shall see now, a slightly weaker reverse estimate is valid as well. More precisely we 
have the following. 

Proposition 3.3 Let d be the metric defined in l\3.1\i . Then for any e > 0, it is true that 

N{T,d,2s) < N{T,d,e) . (3.3) 

Proof: Take any e-net S C T. For each s £ S we choose G B^{s) such that A s is the 
minimal element in {r A s : r G i?£(s)}. Then we have 

r^As^rAs^s, 

whenever r € B^{s). 
Set 

S := {rs A s : s e S} . 
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Clearly, we have ij^S < #5, hence it suffices to prove that S is an order 2e-net of T. To this 
end take any t €z T. Then there is an s G such that t G B^{s) and by the choice of it follows 
that TgAs^tAs^t. Thus it remains to estimate the distance between As and t. Note that 
the definition of d implies d{rs A s, s) < d{rs, s) < e. Thus the triangle inequality leads to 

d{rs As,t) < d{rs As,s) + d{t, s) < 2e 

because of t G B^^s). This completes the proof. □ 



4 Upper Entropy Estimates for Weighted Summation Opera- 
tors 

Here and later on the basic assumption about the weight functions a and a is that they satisfy 
(|1.2|) for some fixed q G (1, oo) and that a is non-increasing. 

In a first step we investigate weights a attaining only values in {2"™" : m G Z}. Without 
losing generality assume a{0) = 1, hence there are subsets Im T, m > 0, such that 

oo 

a(.) = 5^2-™ s€T. (4.1) 

Since a is supposed to be non-increasing, the sets Im possess the following properties: 

1. It holds T = IJm=o ^^^'^ Ii Im = ^ provided that / / m. 

2. Whenever B d T is a, brcinch., then for each, in ^ either B n — or it is an order 
interval in T. Furthermore, ii I < m, t £ B Ii, s £ B D Im, then this implies t ~< s. 

Define an operator W on ii{T) by 

{Wx){t) := a{t) a{s)x{s) = a{t) 2"™ ^ x{s) , t e Im ■ (4.2) 

The mapping W acts as a "partial" weighted summation operator depending on the partition 
{Im)m>o- We claim that condition ()1.2p implies that is a bounded operator from £i(T) into 
£q(r). To see this define the unit vectors 5t G h{T), t G T, by 



Stir) :-- 

Then 



1 : r = t 
: r / t . 



W{6t) = ^ a(r) a{r) 6r = ^ a(r) 6r , t e Im , 
hence (|1.2p implies sup^^^ ll^('^t)llg < ^ and W : £i{T) i-)- £q{T) is well-defined and bounded. 
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Define the set Ew ^ iq{T) by 

Ew := {Wi6t) -.tGT} 

and let the metric d on T be as in (|3.ip with weights a and a satisfying (|1.2p and ()4.ip . 
respectively. Then the following holds. 

Proposition 4.1 We have 

NiEw,\\ - lU,^) <N{T,d,e) + l . 

Proof: Fix e > and choose an arbitrary order e-net 5 in T (w.r.t. the metric d). Given t £ T, 
there is a unique m > with t £ Im- By definition we find an s G S satisfying d{s, t) < e as well 
as s ^ t. Assume first that s G 1^ as well. Then we get 

\\W{6t)-W{6s)\\^= a{rA • 2— = ||a ||^ < t) < e . 

Otherwise, if s G /; for a certain I < m, we argue as follows: 

1/9 



\\WiSt)\\^ = 
Consequently, the set 



E • 2-™ < airy . ait) < dis, t) < e 



{Wi5s) : s G 5} U {0} 

is an e-net of E\y in iqiT). This being true for any order net S completes the proof. □ 

Proposition 4.2 For q G (l,oo) let p := min{g, 2} and 1/p' := 1 — 1/p. Furthermore let a > 
and < 5 < oo . If we have 

N{T,d,e) <ce-^\loge\\ (4.3) 

then this implies 

en{W : hiT) ^ IqiT)) < c' n'^^^-^^'P' ilogn)^/^ . (4.4) 
// instead of ( [^.5] ) we only have 

logiV(r,d,e) < ce-" , (4.5) 



then we get 



eniW : hiT) ^ iq{T)) < u-^'p' ilogn)^/^' -^''^ (4.6) 



whenever a < p' while for a > p' we have 

eniW ■.£i{T)^£giT))<c'n-'/''. (4.7) 
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Proof: If we assume (j4.3p , by Proposition 14.11 we also have 

N{EwA\-\\g,^) <ce-''\loge\' . (4.8) 

Observe that en{W) = en{aco{E]y)) , where aco(-B) denotes the absolutely convex hull of a set 
B Iq (T) . Thus we may use known estimates for the entropy of absolutely convex hulls as can 
be found in or [19]. For example, assuming (|4.8p we may apply Corollary 5 in [19]. Recall 
that iq{T) is of type p with p = min{2, q}. Hence we get 

e„,{W) = en{aco{Ew)) < c' n-^/''~^/P' (logn)''/'' 

which completes the proof of ()4.4p . 

Assuming ()4.5p estimates ()4.6p and ()4.7p follow by similar arguments using Corollaries 4 and 
3 in [1^, respectively. □ 

Our next objective is to apply the previous results to weighted summation operators. To 
this end let a and a be two weight functions satisfying (|1.2p . Here a is an arbitrary non- 
increasing weight. Then we define the weighted summation operator Va,a as in (jl.Sp . Under 
the assumptions on the weights the operator Va,a is well-defined and bounded from ii{T) into 

The main goal is to relate the degree of compactness of Va,a with the behavior of N{T, d, e) 
as e — >• 0. Here the metric d is defined as in (j3.ip by a and cr. In a first step we suppose that a 
is of the special form ()4.ip with sets Im defined there. 

Given t € T, set 

Kt■.= {k>^■.hr^[Q,t]^%} . 

Consequently, if e Kt^ then 1^ n [0,t] = [Afc(t), ^fc(t)] for some \k{t) ■< 0k{t) ^ t. Note that 
^m(i) = t for t S /m and 

[0,t] = \J [Xk{t),ek{t)]. (4.9) 

keKt 

Define now an operator Z : li (T) i— )■ ii (T) by 

Z{6t) := 2^-^5e,(^t) , tGlm. (4.10) 

keKt 

Proposition 4.3 Assume ([23) and (jQ and define W : ii{T) ^ £g{T) and Z : li{T) H> ii{T) 
as in d^.^p and ( |^. iOp , respectively. Then Z is bounded with \\Z\\ < 2 and, moreover, the operator 
Va^a given by () i.gp admits a decomposition 

Va,a = WoZ. (4.11) 

Proof: By the construction, for each t £ Im we have 

m 

||^(<5t)||i< ^ 2^=— <^2'=-™<2, 

keKt k=0 
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hence, in view of \\Z\\ = sup^gj. this imphes \\Z\\ < 2 as asserted. 

To prove (j4.1ip first note that for t ^ T and k £ Kt we get 

re[Afc{t),efe(t)l r-G[Afe(t),6»fe(t)] 



hence, if t G /m, then by ()4.9p this imphes 



W{Z{6t)) = ^ 2^ 



2-'' "(^)'^- 



2-™ "(^)'^- 

re[0,t] 



On the other hand, 
and it fohows that 

VaA^t) = W{Z{6t)) . 

This being true for any t £ T proves (|4.1ip . □ 

Theorem 4.4 Let a and a be weight functions satisfying {\1.2^ where a is arbitrary non- 
increasing weight. If 

N{T,d,e) < ce~''|loge|'' 
for some a > and b > 0, then this implies 

with p as in Proposition \4-S\ If 



log N{T, d, e) <c£-'' (4.12) 

we get 

whenever a < p' while for a > p' we have 

CniVa,^ : hiT) ^ £g{T)) < cn^^/'^. 
Proof: Suppose as before cj(0) = 1 and for m > define subsets /m ^ ^ by 

I^:={teT: 2-""-^ < a{t) < 2-™} . 

If 

a(t):= 5]2-™l,„(t), teT, 

m>0 

then (T is a non-increasing weight function as in (|4.ip . By the construction it foUows that 

a{t) <a{t) <2a{t), teT. (4.13) 
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Define the metrics d and d as in ()3.ip by a and by a or a, respectively. In view of ()4.13p we get 

d{t,s) < d{t,s) <2d{t,s) , t,s£T, 

hence 

N{T, d, e/2) < N{T, d, e) 

which implies 

N{T,d,s) < ce-^\loge\'' 
as well. But now we are in the situation of Proposition 14.21 and obtain 

en{W : h{T) ^ £q{T)) < cn'^/'^-^/P' (logn)^/'' . (4.14) 
An application of Proposition 14.31 yields now 

en{V^,a) = en{W o Z) < en{W)\\Z\\ < 2en{W) , 

hence by ()4.14p it follows that also 

e„(K.,^)<cn-i/"-i/*''(logn)''/V 

To complete the proof note that (|4.13p implies that the diagonal operator A : ii{T) i— )• ii{T) 
defined by 

^ ' a{t) 

is bounded with ||A|| < 1. Of course, 
hence 

en{Va,a) < e„(Fa,a) ||A|| < e„(K.,^) (4.15) 

completing the proof of the first part. 

The second part is proved by exactly the same arguments. Indeed, (j4.12p implies 

log N{T,d,e) <c£-'' . 

An application of Proposition 14.31 yields now 

en{Va,a) = en{W o Z) < en{W)\\Z\\ < 2en{W) 

and the estimates follow by the second part of Proposition 14.21 via (j4.15p . □ 

Remark: The critical case a = p' \s excluded in the second part of Theorem 14.41 This is due 
to the fact that in that case only weaker estimates for e„(aco(i?iy))) hence for CniW) and also 
for eniVa^a) a™ available. Indeed, using Corollary 1.4 in \J\ it follows that ()4.12p only gives 

en{Va,a ■■ h(T) ^ ig{T)) < c'n-^/" logn . 

But the results in |13j suggest that the right order in that case is n"^/", i.e., the above estimate 
probably contains an unnecessary extra log-term. 
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5 Lower Entropy Estimates 

We start with a quite general lower estimate for weighted summation operators on trees. 

Proposition 5.1 Suppose there are m pairs of elements ti,Si in T possessing the following 
properties. 

1. It holds ti -< Si and {ti,Si\ Ci {tj,Sj] = for 1 < i, j < m, i ^ j . 

2. For some e > we have d{ti, Si)>£, 1 < i < m. 
Then this implies 

with some c = c{q) whenever logm < n < m. 

Proof: The strategy of the following construction consists of "inscribing" the well studied 
identity operator from into £^ into our operator V^,(j- 

The definition of the metric d implies the existence oi Vi e T such that ti <Vi^ Si and 

a{ry I cr{vi) > e , 1 < i < m . 

By assumption the intervals Jj := {ti,Vi], I < i < m, are disjoint subsets of T. 
Next define elements yi G ^i(r) by 

X X 1 ^ • ^ 

Vi ■= Ovi 77T ^ti, l<i<m , 

cr[ti) 

as well as an operator I : if' ^ h{T) by setting 

I{6i) := yi, l<i<m . 

Here Si is the i-th unit vector in if = . . . ,m}). Then a{vi) < a{ti) implies < 2, 

hence ||/|| < 2 as well. 

The image Zi G £q{T) of yi w.r.t. Va,a equals 

Zi ■■= Va,a{yi) = (^{vi) ^ a{r) 5r, l<i<m . 

In particular, the support of Zi is contained in Jj. 
Finally, let 
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and 



By the choice of the /3j we obtain \\bi\\^, = 1. Moreover, since the order intervals J, are disjoint, 
it follows that 

{zi,bj)=0, l<ij<m,iy^j, 
while ^ 

{zi,bi) = a{vi) I airy] >e, 1 < i < m . (5.1) 

If P : lq{T) ^ £^ is given by 

it holds ||P|| < 1. Indeed, if 2; € ^q(T), then it follows that 

mm m 
i=l i=l i=l 

as claimed. 

Summing up, 

PVa,aI{^i) = i^i^bi) 6i , l<i<m, 

and because of (|5.1|) we obtain for the identity Id^ from into ^™ 

Id„ = A O {PVa,aI) 

with a diagonal operator A satisfying || A : £™ 1— )• < e^^. Consequently, we arrive at 

e„(Id^ : C ^ C) ^ e'^eniPVa,aI) < 2e-i e„(14,<,) . 
To complete the proof note that a result of Schiitt (cf. [18]) asserts that 

'log(l + m/n) ^ ^^'^ 



n 



as long as log m < n < m. □ 

In order to apply Proposition 15 . 1 1 we have to find sufficiently many order intervals (tj, Sj] pos- 
sessing the properties stated above. The next result shows that we can find at least A^(r, d, 2e) — l 
such intervals. 

Proposition 5.2 Let e > be given. Then there are at least N{T, d, 2e) — 1 order intervals 
(ti, Si] such that d{ti, Si) > e and (ti, Si] n {tj, sj] = provided that i ^ j. 
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Proof: Let S = {si,...,s„} C T be a maximal 2e-distant set, i.e., it holds d{si,Sj) > 2e 
whenever i ^ j. Since S is chosen maximal, for any t ^ T there is an Sj E with d{t, Si) < 2e. 
Thus S is a 2e~net and, consequently, it follows that n > N{T,d,2e). Among all elements in 
S there is at most one Si with d{0, Si) < e. Thus, by changing the numeration we may assume 
d{0,Sj) > e for 1 < j < n — 1. For each such sj G S we define now tj -< Sj as follows: It holds 
d{tj,Sj) > e, but if tj <t < Sj, then d{t,Sj) < e. Such tj exist (and are uniquely determined) 
by d(0, Sj) > £ and by the monotonicity property of d. We claim now that the order intervals 
(ti, si], . . . , {tn-i, Sn-i] possess the desired properties. By the construction d{tj, sj) > e and it 
remains to prove that the intervals are disjoint. Assume to the contrary that there is some t in 
(tj. Si] n (tj, Sj) for certain i / j. Then it follows d{t, Si) < e as well as d{t, Sj) < e by the choice 
of the tj. This implies d{si, Sj) < d{t, Si) + d{t, Sj) < 2e which contradicts the choice of the set 
S and completes the proof. □ 

Let us state a first consequence of Propositions 15.11 and 15.21 
Theorem 5.3 Suppose that for some a > and b >0 we have 

N{T,d,e) >ce-''\loge\'' . (5.2) 

Then this implies 

en{Va,a : h{T) ^ £,{T)) > cn-i/"-W (log n)V- (5.3) 
with a constant c = c{c, q) . In particular, if 1 < q <2, then 

Ar(r,(i,e) w e'"|loge|^ (5.4) 

implies 

en{Va,a : 4(T) ^ 1,{T)) ^ c n'^ ' I"' {log nf'^ . 

Proof: In view of Proposition 15.21 the assumption implies that there are m disjoint order 
intervals (tj, , Sj] with d{ti, Si) > e. Hereby we may choose m of order |loge|^. Next we apply 
Proposition 15.11 with n = m and obtain 



log 2 



n 



en{Va,a) >ce('-^] > 5 n"!/'^" V?' (log n)"/" . 



This completes the proof. □ 
Remark: 

(1) Note that by (|3.3p in (j5.2p as well as in (j5.4p the covering numbers N{T, d, e) may be replaced 
by the order numbers N{T,d,e). 

(2) It remains open whether or not in (|5.3p the expression n"^/''^^/'?' may be replaced by 

^lignever 2 < q < oo. For those q remains a gap between the upper estimate in 
Theorem 14.41 and the lower one in Theorem 15.31 
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Our next objective is an application of Propositions 15.11 and 15.21 in the case of rapidly in- 
creasing covering numbers. 



Theorem 5.4 Suppose that 

log N{T,d,e) > ce-" 
for a certain a > 0. Then this implies 

en{Va,. : ^i(T) ^ 1,{T)) > cn-'/'^' (logn)'/'^'-'/- (5.5) 

provided that a < q' . On the other hand, if q' < a, then it follows that 

en{Va,a : h{T) ^ £,{T)) > cn-^l'^ . (5.6) 

Proof: First observe that Proposition 15.21 implies the existence of m disjoint order intervals 
(ij, Si] with d{ti, Si) > £ where the number m satisfies logm ~ e~'*. 

So let us prove (j5.5p . We use Proposition 15.11 with n ~ y/rn and note that the choice of m 
and n implies e ~ (logm)^^/" ~ (logn)"^/". Of course, logm < n < m, thus Proposition 15.11 
applies and leads to 

e„(K.,.) > ce(i^^^i±^y^' >c(logm)-Va+i/.'^-W 

> c(logn)-^/''+^/^'n~^/'^' 

as asserted. 

Inequality (j5.6p follows by similar arguments. The number m is chosen as before but this 
time we take n of order logm. This implies e ~ and we get 

e„(K.,.) > ce >ce> cn^^'^ 

as asserted. □ 

Remark: Note that (|5.5p as well as ()5.6p are valid for all a > 0. But for a < q' the first 
estimate is better while for a > q' the second one leads to a better lower bound. 

6 Examples of Upper Entropy Estimates 

The aim of this section is to apply the previous results for weights and trees satisfying certain 
growth assumptions. We start with assuming that there is a strictly decreasing, continuous 
function ip on (0, oo) with 

/•oo 

/ ^{x) dx < OO 
Jo 

such that for some fixed q < co holds 

{a{t)a{t)y < V{\t\) , teT. (6.1) 
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The next objective is to construct order e-nets on N for a metric generated by Later on those 
nets on N lead in natural way to nets on trees. Given 93 as above define ^ on [0, cxd] by 



poo 

^>(y) := / ip{x)dx, < y < 00 , 



(6.2) 



and $(00) := 0. The generated metric d on [0,oo] is then defined by 

d{yi,y2) := ^(yi) - ^'(^2) = / ^{x)dx (6.3) 

■Jyi 

provided that yi < y2- Given e > we construct a 2e-net for (N, d) as follows. First we take 
all points in N up to the level ip^'^{e), i.e., as a first part of the net we choose 

Me := {n > 1 : n < (^"^(e)} = {n > 1 : Lp{n) > e] 

and note that < ^'^{e). 

It remains to find a suitable 2e-cover for {n > 1 : n > 93"^ (e)}. Here we proceed as follows. 
For A; = 1, . . . ,iV set 

Uk:=^-\ke) (6.4) 

where the number is chosen as 



max < A; > 1 : A; < 



iV := max{A; > 1 : {tfe > V3"-^(e)} = max {A; > 1 : /ce < $((^~^(e))} 

Note that ui > U2 > ■ ■ ■ > un and, moreover, since in that region •^{x) < e we necessarily have 
Uk-i — Uk > I, k = 1, . . . , N . Hence, setting (here [u] denotes the integer part of n G M) 

Uk-=[uk], k = l,...,N -1 , 

it follows that ui > ui > U2 > ■ ■ ■ > un-i > un- It remains to define un- If [un] > ^p~^{e) 
we set uat := [un\. Otherwise we take un := [ip~^{e)] + 1. By the construction it follows that 
d{uk,Tn) <2e for all m G N with < m < u^^i where uq := 00. Consequently, the set 

Se:= M,U{ui,...,un} (6.5) 

is a 2e"net of (N,(i). 

Next we want to apply the preceding construction to build suitable e-nets on trees. Recall 
that R{n) denotes the number of elements in the n-th generation of a tree. 

Proposition 6.1 LetT be a tree such that R{n) < p{n) for a certain continuous, non-decreasing 
function p on [0,oo). Furthermore, suppose that the weights a and a on T satisfy i\6.1\i for a 
certain q > 1 and some function ip as before. Define the metric d as in l\3.1\i with a, a and q. 
Then it follows 

N{T,d,e)< p{x)dx + p{^-\ey2)) + 2e-'i p{yMy)dy 

Jo Jip-^(£i/2) 



where ^ is as in (fSHI)- 
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Proof: Assuming ()6.ip it follows (recall that a is non-increasing) that for all t ^ s in T 

1/9 



d{t, s) = max 



Lax |( ^ a{ryy^\iv)\ < ( J] (a(rMr))^) < ( c^(|r|) 

^ J. ^ I ^ I t 



|t|<fc<|s 

Hence, if = U {ui, . . . , uat} is defined as in (|6.5|) . setting 

Se := {t e T : \t\ e Se^} (6.6) 

we obtain an order 2-^/''e-net for {T,d). 

To proceed further we have to estimate H^Ss suitably. In view of R{n) < p{n) we get 

N N 

#Se < Yl Pin) + Ypiuk)< Yl Pin) + Yp{uk) 

n<ip-^(e^) k=l n<ip-'^{ei) k=l 

N 

n<(p-i(£9) fc=l 

Since p is non-decreasing and po (^^^ non-increasing, this leads to 

#Se < / p{x)dx + p{^'^{e'i))+ / p{^-^{xe'i))dx 

p{x)dx + p{^~\e'^))+e-'' / p{yMy)dy. 

Jip-^(ei) 

Finally, we use N{T,d,2^/'^e) < #Ss and replace by e'^/2. This proves the proposition. □ 
One can slightly simplify the bound for subsequent use as follows. 

Corollary 6.2 

1. Convergent case: 

Suppose that p{y) ip{y) dy < oo. Then it follows that 



N{T,d,e)< p{x)dx + p{<^-\ey2)) + 2e-'i p{yMy)dy . (6.7) 



2. Divergent case: 

V ir P(y) ^(y) dy = oo , then we get 

N{T, d, e) < p{x) dx + p($-i (eV2)) + 2 e""? pivMv) dy • (6.8) 

Let us state and prove a first application of Proposition 16.11 in the case of moderate trees, 
i.e. those where the number of elements in the generations increases at most polynomially. 
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Proposition 6.3 Let T be a tree such that R{n) < cn^ for some H >0. Suppose, furthermore, 
that 

a{t)a{t) < c\t\-^/'^ , teT, 

for some 7 > 1. Then it follows 

e ^-1 : 7 < fl" + 1 

N{T,d,e)<c{ e-nog{l/e) : ^ = h + 1 

e -I : 7 > H + 1. 

Proof: First we note that in all three cases the behavior of the first and the second term in 

q(H + l) qH 

(j6.7p or (j6.8p is e ^ and e t-^ , respectively. Only the third term behaves differently in 
each of the three different cases. 

Thus let us start with the investigation of this third term in the convergent case, i.e., if 
7 > + 1. We use ()6.7p and observe that the third term behaves as the first term, i.e., as 

q{H + l) 



Since here < the second term in (j6.7p is of the smaller order and we obtain 

_q(H+l) 

N{T,d,e)<ce ^ 

as asserted. 

Next assume 7 = H + 1. This is a kind of divergent case and the third term in (|6.8p is of 
order £~'^ log(l/e), while the first and the second term are of lower order £~'^, and we get 

NiT,d,e) < ce-nog{l/s) 

as claimed above. 

Finally, suppose 7 < H + 1. This is again a divergent case and the third term in ()6.8p behaves 

like 

thus the second and the third term are of the same order. Since for 7 < H+1 we have > 

q(H+i) 

the first term that behaves like e 1 is of smaller order. Thus it follows that 

qH 

N{T,d,£) < ce 7-1 

which completes the proof. □ 
An application of Theorem 14.41 to the above estimates leads to the following. 

Theorem 6.4 Suppose 1 < q < 00 and let as before p := min{2,q}. Suppose that the tree T 
satisfies R{n) < cn^ for a certain H >0 and that 



a{t)a{t) < c\t\~^/'^ , teT, 
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for a certain 7 > 1. Then we may estimate the entropy numbers of the weighted summation 
operator Vaa <is follows: 



7-1 1 

Jl qH j7 

_1_ 1 

n 1 7 (log n) 

J 1 

fl q{H+l) p' 



en{Va,a : ii{T) ^ £g{T)) < c { 
Remark: Note that p = q for 1 < q < 2. In particular, in that case 



■y < H + 1 
7 = H + 1 
7 > H + 1 



7-1 1 



n 9( 



7 1 

g(-H+l) 7 



7 < + 1 
7 = H + 1 

7 > H + 1 



Our next objective is to investigate weighted summation operators on binary trees. Here we 
have p{x) = 2^. Let us first suppose that the weights decay polynomially, i.e., we suppose 

a{t)a{t) < c\t\-"'/'' , teT, 

for some 7 > 1. Of course, in order to estimate N{T, d,e) we have to use the divergent case of 
Corollary 16.21 Then we get 

log / p{x) dx ^ e ''''^ and 

Jo 

logp($"i(eV2)) ^ £-9/(7-1) _ 

Furthermore, as can be seen easily the logarithm of the third term in (|6.8p behaves like e-'j/^^-i) 
as well. 

Summing up, it follows that 

logiV(r,d,e) < ce-'?/^^-^) . 

Hence we see that the critical case appears if 9/(7 — I) = p' (recall that p = min{2, q}), i.e., in 
the case 

7 = g if 1 < q < 2 and 
7 = g/2 + 1 if 2 < g < 00 . 

In the non-critical cases we get the following. 

Theorem 6.5 Let T be a binary tree and suppose that 

a{t)a{t) < c\t\-^^'i , teT, 

for a certain 7 > 1 with 7 7^ q/p' + 1. Then this implies 
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a) for 1 < q< 2: 

n i' (log n) 1 



eniVa,a : hiT) ^ iqiT)) < c 
h) for 2 < g < oo; 



7 > g 

7-1 

n 9 : 7 < 



1 1 7-1 

n^2(logn)2 9 : 7 > | -|- 1 

7-1 

n <? : 7 < I -M . 



2 

Remark: For one-weight operators, i.e., if a{t) = 1, t G T, and for q = 2 the preceding result 
was also proved in |13j . Moreover, it was shown there that the above estimates are sharp. But 
the main result in [13] is the investigation of the critical case 7 = 2 if q = 2. As mentioned 
above the general results for the entropy of the convex hull in [7^ lead only to 

en{Va,a ■ h{T) ^ £g{T)) < cu'^ log n 

in the critical case 7 = q/p' + 1. 

Let us shortly mention a third example. Again we take a binary tree T, but this time the 
weights decrease exponentially, i.e., we assume 

a{t)a{t) < c2~9'*' , t£T, 

for some 7 > 0. Hence we have (^(x) = 2""^^ and 

(^-i(e'?)~c^.-i(e'?)~^log2(l/e). 

7 

Thus all terms in ()6.7p and ()6.8p are of the same order e~'^^"' and under these assumptions 

iV(T,d,e) < ce-''/7 . 

Thus, it follows 

en{Va,^ : ii{T) ^ UT)) < cn^'^-V (6.9) 
in that case. In completely different probabilistic language, this example was studied in [1]. 



7 Examples of Lower Entropy Estimates 

In Section [6] we proved upper estimates for N[T, d, e) under certain growth assumptions for 
the weights and for R{n), the number of elements in the n-th generation of T. The aim of 
this section is to prove in similar way lower estimates for N{T,d,e) or en{Va,a), respectively, 
assuming lower growth estimates. Thus we investigate weights satisfying 

{a{t)a{t)y > ^{\t\) , teT, (7.1) 
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for a function ip as in Section [6] and, furthermore, we assume 

R{n) > p{n) , n e No , (7.2) 

where p is as before non-increasing and continuous with p(0) = 1. 
Under these assumptions we get the following. 

Proposition 7.1 Assume <\7.1^ and \ 7.S^. Then we have 



N{T,d,e/2) > I p{x)dx . 



Proof: Fix e > and set 

Te:= {teT ■.0<\t\< if'^ie'^)} ■ 

Given s € T, s ^ 0, let s' be the parent element of s, i.e., s is an offspring of s'. Then (|7.1|) 
implies 

d{s',s) = a{s)a{s) > (/^(Isl)^/" > e 

provided that s £ T^. Let now t,s G with t ^ s. If t ~< s, then t ^ s' ~< s, hence 
dit, s) > d{s' , s) > e. Otherwise, i.e. if t and s are incomparable, by the same argument we get 

d{t,s) > d{t As,s) >e 

as well. Consequently, is an e-separated subset of T which implies 

N{T,d,e/2)>#T,. 

Thus, in order to complete the proof it suffices to estimate ^^T^ suitably. Here we use ()7.2p and 
obtain 

0<n<</3-i(e9) 0<n<(p-i(e9) 

as asserted. □ 
Corollary 7.2 Suppose that 

J p{x)dx>ce "|loge| 

for certain a > and b > 0. Then this implies 

en{Va,a : h{T) ^ £g{T)) > cn-i/'^-i/^'Oogn)^/'^ . (7.3) 
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Proof: Using Proposition 17. II the assumption leads to 

N{T,d,e) > c'e-'^|loge|^ . 

Consequently, Theorem 15.31 applies and proves (|7.3p . □ 

Let us apply the preceding corollary for concrete functions if and p. We start with the 
investigation of moderate trees and polynomial weights, i.e., p is of polynomial growth and 
fix) > cx~^ for a certain 7 > 1. Here we get 

Proposition 7.3 Suppose that T is a tree with R{n) > cn^ for some H > 0. Furthermore 
assume 

a{t)a{t) > c\t\~^/'^ , teT, 
for some 7 > 1. Then it follows that 

en{Va,a : h{T) ^ £g{T)) > . (7.4) 

Proof: This follows directly from Corollary 17.21 by evaluating the integral. □ 

Remark: Suppose 1 < g < 2. Then the preceding proposition shows that the estimates in 
Theorem 16.41 are sharp provided that 7 > H + 1. We will see later on that this is no longer 
always true if 1 < 7 < + 1. 

Another application of Proposition 17.11 leading to sharp lower estimates is as follows. 

Proposition 7.4 Let T be a binary tree and suppose that 

a{t)a{t) > c2"9l*l , t£T, 

for some 7 > 0. Then this implies 

en{Va,a : h{T) ^ ig{T)) > cn-^-V . 

Proof: Again this is a direct consequence of Corollary 17.21 and the fact that 

7 7 

Recall that p may be chosen as p{x) = 2^ in that case. □ 



Remark: Combining the preceding proposition with (|6.9p gives the following: Let T be a 
binary tree and suppose 1 < g < 2. If 

a{t)a{t) w 2"^'*' , t G T , 

then 

en{V^,..h{T)^£g{T))^n--^-7 . 
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As we said above, Proposition 17.31 does not always lead to sharp lower estimates, even if 
1 < q < 2. The reason is that here the structure of the underlying tree plays a role. We assume 
now that ^(t) > 1 for each t € T. In different words, we suppose that each element in T has at 
least one offspring. Furthermore we restrict ourselves to one-weight operators defined as follows. 
We write Va instead of Va^cr provided that a = 1, i.e., Va denotes the one-weight operator acting 
as 

{Vax){t):=a{t)Y,x{s), teT. (7.5) 
In the case of a one-weight operator condition (j7.1|) reads now as 

a(t) > ter. (7.6) 

To proceed further we have to exclude functions (p decreasing too fast. Thus we assume that 
there is a constant k > 1 such that 

(p{x) < Kip{2x) , X > xq . (7.7) 

Let $ be defined as in ()6.2p . For later use we mention that ()7.7p implies 

^(x) (p(2x) 

— TT ^ ^ — ^ ^ ) X > xo , 

lf{x) Lp{x) 

hence 

> K V (y) as y^O. (7.8) 



y 

Under these assumptions we get the following general lower estimate. 



Proposition 7.5 Let T be a tree with ^(t) > 1 for each t & T and suppose (7.6) as well as 
l\7.7\). Then there is an Eq > such that 



/.$-i(8e'?) 

iV(T,d,e/2) >4-V / piyMy)dy (7.9) 



whenever < e < Eq. 

Proof: First note that for one-weight operators the metric d reduces to 



whenever t ^ s. By ()7.6p this implies 



d(,t,s)>[ Yl '^(^)) ' >d{\t\ + l,\s\Y^' (7.10) 

k=\t\+l 
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with d defined in (j6.3p . 

We construct now positive real numbers tti > n2 > • • • > uat as in ()6.4p but this time directly 
with e'' instead of e, i.e., 

:= , 1 < A; < iV , 



where satisfies 



Next set 



where 



Using ()7.8p this implies 



N < ^ " <N + 1. 

el 



Vk ■= [u3k] + 1 , 1 <k <m, 



m 



N 



> 



3ei 



,n>iK^ (7,11) 



4e9 

provided that e < Eq for a certain eq depending on ip. 
Since Uk — Uk-i > 1 we get 

[u3k-2,U3k-3] Q [vk + l,t^fc-l] , 

hence 

d{vk + l,vk-i)>e'^ . (7.12) 
Let us construct now an e-separated subset C T as follows. For 1 < A; < m set 

Tk:={teT: \t\ = Vk} 

and given t £ Tk with 2 < k < m we choose exactly one Sk~i{t) £ Tk~i satisfying Sk-i{t) >~ t. 
Those Sk-i{t) exist because we assumed ^(t) > 1 for all t £T. Finally, define by 



Se ■■= U {sfc-lW -t^Tk} . 



k=2 



Because of (jT.lOp and (|7.12p the points in Se are e-separated and since u^k < ^k the properties 
of p yield 

m m m m m 

A/ — 2 /c — 2 /i/ — 2 /i/ — 2 /c — 2 

Clearly this implies 

m 

N{T,d,e/2)>Y,p{^-\Zke'^)). 

k=2 
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Observe that po ^ ^ is decreasing and recall ()7.1ip . Then we get 

N{T,d,£/2) > / p{^-^{3xe'^))dx> p{^-^ {Axe")) dx 

J2 J2 

= / p{y)<f{y)dy 

as asserted. □ 

Remark: Unfortunately, we do not know whether or not an estimate similar to (|7.9p remains 
valid in the case of two weights a and a satisfying (|7.ip . The crucial point is that in this case 
estimate (jT.lOp is no longer valid. For example, take ^{x) = x~'^ for some 7 > 1 and choose the 
weights as a{t) = 21*1/'? and a{t) = |t|-T/92"l*l/« to see that ([710]) is not satisfied in general. 

A first application is for moderate trees with polynomial decay of the weight a. It shows that 
the estimates in Theorem 16.41 are also sharp (at least for one-weight operators and 1 < g < 2) 
for 1 < 7 < H + 1, provided we have the additional assumption ^(i) > 1 for t G T. 

Proposition 7.6 Let T be a tree with ^{t) > 1 for t G T such that R{n) > cn^ . Given 7 > 1 
let a{t) > c\t\^'''^'^ . Then, if ^ < H + 1, it follows that 

N{T,d,e) >ce~^ (7.13) 

Similarly, i/7 = _ff + 1, then 

N{T, d, e) > ce"'? log(l/e) . (7.14) 

For the operator Va we have 



eniVa ■■ iiiT) ^ i,{T)) > c 



7-1 1 

n iH ^' : 7 < H + 1 



n ^ilogny/'^ : 7 = + 1. 

Proof: Of course, v'(x) = c^x~^ satisfies condition (|7.7p . Thus Proposition 17.51 applies and the 
lower estimates in (j7.13p and ()7.14p are direct consequences of (^^^(e"?) w e-'il^ as well as of 
^-^e'') ~ e-5/('>-i). The estimates for e„(Fo) now follow from Theorem 15. 3[ □ 

Another application of Proposition 17.51 is for binary trees and polynomial decay of a. 

Proposition 7.7 Let T be a binary tree and suppose a{t) > c\t\~'^/'^ for some 7 > 1. Then this 
yields 

log N{T,d,e) > ce^«/(^~^) . 

Proof: Again satisfies ()7.7p , hence Proposition 17.51 applies as well and the assertion easily 
follows by 

log / ' p{yMy) dy « <^-\8e'') « ^-^/(^-i) . 

□ 

Combining Proposition 17.71 with Theorem 15.41 leads to the following. 
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Proposition 7.8 Let T he a binary tree and suppose that a{t) > c\t\ '^1'^ for a certain 7 > 1. 
Then this implies 



ri-V9'(logn)i^i'/'' : > q 
n-'^'y-'^yi : 7 < g. 



8 Biased Trees 

We will now test the sharpness of our bounds on an interesting class of trees whose branches, 
opposite to the case of Proposition 17.61 die out quickly. Let H > 1. We define a biased tree of 
order H as follows. Take a binary tree, draw it on the plane so that it grows from the bottom 
to the top, and for any level n > keep only the R{n) rightmost nodes where 

I n^, n > 2h. 

The set of nodes we have kept is a tree since 

R{n + 1) < 2R{n) , n>0. 

We call this tree a biased tree (because it is really biased to the right) of order H and denote it 
by Tff. Since the size of its n-th level for large n is , the biased tree satisfies both the upper 
and lower size bounds 

cn^ < R{n) < Cn^, (8.1) 

as in Theorem 16.41 and in Proposition 17.31 respectively. At the same time the nodes situated on 
large levels die out pretty quickly, which enables more efficient covering constructions than in 
the general case. 

On Th we will consider the usual one-weight operator Va defined in (j7.5|) . Recall that 
N{T}{,d,e) stands for the order covering numbers of the tree Tjj defined in (j3.2|) . Our main 
result for biased trees is the following. 

Proposition 8.1 Let Va be the one-weight operator on Tjj with the weight a{t) = \t\~"'^'^, 
7 > 1, and let d be the metric on Th corresponding to this weight. For the related order covering 
numbers we have 

ce ~ < N{TH,d,e) <Ce (8.2) 
For the entropy numbers of we have 

cn"^(fe"7 < e„(y„) < Cn"^(fe"^. (8.3) 

This shows that the lower bound (j7.4|) of Section [7] can not be improved in general, unless 
we make some extra assumptions about the tree - as in Proposition 17. 6[ 

We also see from this bound that the upper estimates for order covering numbers obtained 
in Proposition 16.31 and those for entropy numbers obtained in Theorem 16.41 are not sharp for 
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certain trees in the convergent case 7 < H + 1 and in the intermediate case 7 = H + 1, while 
the results of Sections [6] and [7] show that in the divergent case (7 > + 1) the estimate for the 
entropy numbers is sharp for any tree satisfying ()8.ip . 



Now we start proving Proposition 18.11 
Proof: The construction will be based on the same set of levels as in ()6.5p but we specify it for 
our situation. Let 

poo 

$(y) = / x-^dx = ct/~(^"^\ (8.4) 



y 



We will use the following elementary property. For any positive integers n < m it is true that 



m 



^ k-^ < x-^dx = ^>(n) - $(m). (8.5) 

k=n+l 

Given e £ (0, 1), let J = [e^''/'''] and choose a decreasing sequence of integers (?^j)i<j<j, by 

rij := inf{n G N : <I>(n) < je'^}. 
We also let no := +00 for uniformity of further writing. By (|8.4|) we have 

In particular, we have 

nj<Ce~^. (8.6) 

Now we define our order net as follows: Se := 5^ U 5^, and := {s : \s\ < nj}, whereas 

J 

'S'e := 1^ Sej, 

i=i 

and Sej consists of the first := min{c*e 1 , R{nj)} rightmost nodes of the level nj. The large 
constant will be specified later. Recall that in the construction we used to prove Proposition 
16.11 we included in the net the entire levels, see ()6.6p . Due to the structure of the biased tree, 
only a small part of the level suffices, thus the net is more efficient. 
The size of the net is bounded by 

nj J nj ^ ^ 

#Se < ^i?(n) + ^i/j <C^n-f^ + J-c*e"V < C7nJ+^ + J-c,e"V 

n=l j=l n=l 



< Ce f by ()8.6p and by the definition of J. 

In order to evaluate the precision of the net, we will use the following structural property of the 
biased tree. 

Lemma 8.2 Let j < J and let s £ T be such that \s\ > nj . Then there exists at£ S^j+i such 
that t ^ s. 



27 



Proof: First of all, notice that it is enough to consider the case |s| = nj. Indeed, for any s with 
\s\ > rij we find s' satisfying |s'| = nj and s' ^ s. Once the lemma is proved for s', we find an 
appropriate t G Sgj+i for s' and conclude from t ^ s' < s that t ^ s. 

So let us assume that \s\ = rij. Now look at I'j-i-i = min{c*e t ,i?(nj+i)}. If lyj+i = 
i?(nj+i), this means that Ssj+i coincides with the entire n^+i-th level of Th- Then of course 
there exists t £ Se j+i such that t -< s. 

On the other hand, if i^j+i = ^ , then our statement reduces to the numerical inequality 



.2. 



H 



c^e~~ ■ 2"^-"^+i >Cnf. (8.7) 

Here the left hand side is the total number of offsprings of elements in Ssj+i counted on the 
nj-th level of zhe initial binary tree and the right hand side is an upper bound for the size R{nj) 
of Tij-th level in Th- 

1 

It follows from the definition of rij that rij ~ c{je'^) ^-i , hence 



If c* is large enough, then for any x > we have 2'^^^ > Cx t . By letting here x = rije'^ we 
obtain 

hence, 

c^e-~ ■ 2"^-^^^+! > c*e"~ • 2^^^= > c*e"~ • Cc-^nfe~ = Cnf , 



and (HZl) fohows. □ 
Now the precision of the net is easy to establish. Recall that if t -< s then 

r^s |t|<fc<|si 

Next, if s ^ 5^, only the two following cases are possible. 

1) \s\ > ni. Apply Lemma [82] with j = I. We find t G 3^,2 such that t ^ s. Then by (USD 
and ()8.5p we have 

oo 

d{t,sy< ^"^ < ^K) < 26". 

k=n2+l 

2) rij < |s| < for some 2 < j < J. Apply Lemma 18.21 with j. We find a t G Ssj+i such 
that t-< s. Then by ([SSD and ([83]) we have 



fc=nj-|-i+l 

Therefore for any s G Tf/ we have a. t G such that t ^ s and d{t, s) < 2-'^/'? e. Taking into 
account the bound for #S'e, we see that N(TH,d,e) < Ce t 
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For the lower bound, take any distinct s,t £ S^, that is |,s| < nj, \t\ < nj. Then 

d{s,t) > max{a(s),a(t)} > {nj)~^/'i > ce, 
while the number of points we consider is bounded from below by 

nj-l nj-l q{H+l) 

#Sl = R{n) > c ^ >c{nj - 1)^+^ > ce''~^ . 

n=l n=l 

It follows that 

N{TH,d,e) > N{TH,d,e) > ce"^^, 

as required in ()8.2p . For the entropy numbers, the upper bound in (|8.3p follows from the upper 
bound in (|8.2p via Theorem 14. 4| while the lower bound in (|8.3p was proved in the more general 
context of Proposition 17.3^ see (|7.4p . □ 

9 A Probabilistic Application 

Due to the well known relations between the entropy of operators on Hilbert spaces and small 
deviation probabilities of Gaussian random functions, our results have immediate probabilistic 
consequences. Thus regard Va,a as operator from ii{T) into i2{T). Its dual V*,^ maps £2iT) 
into iooiT), hence it generates a Gaussian random function X = [Xt)t<^T by 

Xt := {V:,M{t) = <T{t) Y c^ir) & 

where {(,r,r G T} is a family of independent AA(0, l)-distributed random variables. The covari- 
ance structure of X is given by 

E XtXs = o-(t)a(s) Y «(^)^ ' t,seT . 

r^tAs 

Such summation schemes on trees are extensively studied and applied, see e.g. the literature on 
Derrida random energy model [2j or displacements in random branching walks |17] . to mention 
just a few. 

As one consequence of our results we get the following. 

Proposition 9.1 Suppose N{T,d,e) ~ e~"|loge|'', for some a > 0,6 > 0. Then this implies 
that 

-logP lsup\Xt\ < e) « llogel'' . 
Proof: An application of Theorem 15.31 implies 

en{Va,a : h{T) ^ l2{T)) « n-^/-"\\ognf'- . 
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Next, duality results for entropy numbers (cf. [21]) lead to 

as well. Recall that V*^ generates X, hence we may apply the classical Kuelbs-Li result (see 
[lU] or [12]) and obtain 

-logP ( sup|Xt| < e ) « e^" |loge|'' 

as asserted. □ 
Remark: By the same methods one gets that N(T,d,e) < ce~" |loge|^ yields 

-logP ( sup|Xt| < e ) < ce"" |loge|'' . 

Surprisingly, this looks exactly as a special case of a general small deviation result due to M. 
Talagrand (cf. [20] or [H]). Yet the main difference is that in the cited result one uses the 
covering numbers w.r.t. the so-called Dudley distance dx while our results are based on these 
numbers w.r.t. the metric d defined in ()3.ip . This suggests that there is maybe some relation 
between dx and d. Even if this is the case, it is at least not obvious. 

10 Concluding Remarks and Open Problems 

We must say that the study of summation operators on trees we merely initiated here is far from 
being complete. For example, recall that many of our estimates are proven to be sharp only in 
the range of the parameter q < 2 while there are gaps for q > 2. It would also be quite natural 
to consider the operators acting from ip{T) into iq{T) with general p,q £ [l,oo]- In both cases 
the reason of difficulties is that the technique of convex hulls that we refer to in Section |3] is not 
appropriate anymore and other tools are needed. 

In this context let us mention the following related open question: Given 1 < p,q < oo and 
a tree T. For which weights a and a is Va^a a bounded operator from £p{T) into iq{T) ? To 
our knowledge this is even unknown if T is a binary tree. Let us shortly recall the answer to 
this question in the case T = No (cf. [6] where it was derived from the classical Maz'ja-Rosin 
Theorem for weighted integration operators). We formulate it only in the case 1 < p < q < oo 
although the answer is known for all p,q £ [1, oo]. 

Proposition 10.1 If 1 < p < q < oo, then Va,a is bounded from ^p(No) into ^q(No) if and only 
^f 

sup ||al[o,„]|| ||crl[^^^)|| < oo . 

We shortly mention two other problems related to the presented topic. 
(1) Throughout the paper we always assumed a to be non-increasing. This property was used 
at several places. For example, it played an important role in the proofs of Proposition 13.21 and 
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15. H respectively. If a is not necessarily non-increasing, then the distance d has surely to be 
modified as 

d{t, s) := max 1 1 al(t,,] 1 1 J | al 1 1 ^ 

whenever t ^ s. Unfortunately, then, in general, d can no longer be extended to a metric on T. 
Nevertheless we believe that some covering properties of T w.r.t. d are tightly connected with 
compactness properties of Va,a- At least this is suggested by the known results for compactness 
and approximation properties of weighted integration operators as proved in [5] or jl4) . 

(2) A challenging problem is the critical case as treated in Theorem 14. 4i Some partial results 
related to the critical case are known. For example, in [13] the problem is solved for the binary 
tree provided that q = 2 and a{t) = 1. Other results in the critical case we are aware of are 
based on [3] and will be handled in a separate publication. 
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